Analysis of Weather Conditions and their Relation to Traffic Accidents in the US#

City planners, civil engineers and car manufacturers have worked tirelessly for decades to make driving a safe, comfortable and accessible option for travel. Safer roads, smarter rules and cutting edge technologies have been implement to reduce the frequency of accidents with motor vehicles, to great success as accidents on the road have dropped tremendously since the 1960s according to IIHS. Aside from individuals with poor driving skills and faulty vehicles, one of the hindrances in motor safety has been the chaotic essence of mother nature. In the US, forty-two thousand people have died in 2022.

Some would argue that weather is primarily responsible for the amount of motor vehicle accidents. Weather phenomena such as hurricanes, heavy rain, icy roads and thick fog are able to severely debilitate a person’s driving skills and could pose a threat to their own safety, and that of other drivers around them. Others state that the weather is merely a small issue, and that accidents have many more causes other than weather. They argue that factors such as inexperience in driving, unmaintained roads or sloppy city planning cause just as much, if not more accidents on the road.

We set out to research the correlation between weather and accidents in the US. To achieve this we took a look at three datasets, these being ‘US Accidents (2016 - 2023)’, ‘Traffic Violations in USA’ and ‘Historical Hourly Weather Data 2012-2017’. ‘US Accidents (2016 - 2023)’ (Moosavi et al.) is a detailed record of vehicle accidents in the US between 2016 and 2023, having recorded the time, place and severity of accidents, among many other variables. ‘Traffic Violations in USA’ (Gutierrez) is a dataset containing a large amount of information about traffic violations and the accidents caused by them. ‘Historical Hourly Weather Data 2012-2017’ (Beniaguev) is a record of weather across the US between the years 2012 and 2017. This dataset has recorded the hourly weather status of many large cities across the United States. With these three datasets we can analyze and compare the link between heavy weather and car accidents across the US.

The First Perspective#

Driving through heavy rain or severe weather conditions can be disoriëntating and can lead to dangerous situations. This is why weather has a lot of influence on the frequency of accidents. Freezing temperatures and bad weather are a dangere to traffic. Cities with severe weather are more prone to traffic accidents than cities with less severe weather. To combat this issue, more precautions should be taken to prevent accidents caused by weather.

Accidents per Month#

When looking at weather data, it can seem very sporadic. One way to overcome this unpredictebilty is to look at monthly accident data, becouse months have a general pattern in weather data. For example, the summer months (June, July and August) are hotter and dryer than say the winter months (December, January, February). In order begin proving the first perspective, the data should show some difference in number of accidents per month. This is becouse weather for month to month differs drasticaly, so this is where the story should begin. In the following graph you can see the amount of accidents per month in the US.

It is notable to see that there is a significant increase in accidents during the months of august, september, october and november. An explaination for this observation could be that there are more people on the road during these months, this could be becouse of the summer holyday in the months july and august.

Effect of freezing#

When analysing this graph, some noticable features are the amount of accidents per datapoint rather than it being spread out evenly. This is because the only datapoints in the graphed dataset are days with freezing temperatures the datapoints are only of one state so the data is lower than graph beforementioned graph with all cities combined.

Number of accidents and Weather scores#

The two Bubble Maps are meant to put the correlation between our weather-score, and the amount of accidents in a city, into perspective. In the final version we want to make this 1 big plot instead of two smaller ones, but we ran out of time. We will also look into other variables that might have an effect on the er of accidents(like the total amount of roads in an area).

This visualization combines elements of a choropleth and a bubble plot, featuring a time-slide function and zoom controls. It aims to illustrate variations in accident frequency across cities throughout the four seasons of 2016 and 2017. However, the data shows significant disparities in total accidents between seasons, suggesting possible gaps in the dataset or substantial seasonal differences in accident occurrence. Additionally, the weather score does not appear to consistently influence accident numbers. Upon analysis, the map colors fluctuate frequently, particularly noticeable in states like Louisiana (on the right side), where accident rates remain high regardless of weather scores. These factors can potentially lead to misleading interpretations of the data.

This facetgrid illustrates how weather scores impact accident severity across various US regions, each characterized by distinct weather patterns. Each subplot represents a different region, showcasing the relationship between weather scores and accident severity. In each subplot, regression lines are included to assess correlation, yet none of these lines show a discernible pattern, suggesting no clear relationship between weather conditions and accident severity.

The Second Perspective#

Accidents are not the cause of the weather, but rather the causes of other factors such as the city, the state and the road condition. Weather can contribute, but is a minor factor as most vehicles are built to withstand most weather events. Urban infrastructure, regional traffic laws, and maintenance of roadways play significant roles in accident occurrence. Therefore, addressing these factors is crucial for improving road safety and reducing accidents.

Surrounding conditions#

In the following graph you can see that most accidents take place with no relevant infrastructure such as traffic stops or traffic lighs nearby. It is however notable that aside from the no infrastructure, most accidents take place at junctions or traffic_signs. These are also the places that are most susceptible to human error. We can conclude from this graph that the place where the traffic accident takes place is relevant to the cause of the accidents.

Traffic violations#

Another variable that could influence the amount of traffic accidents is traffic violations. Inexperienced drivers or drivers under the influence could be the majority of the causes of traffic accidents. The plot shown below visualises the correlations between traffic violations, and it visualises the weather score. The traffic violations index is shown on the left on the y axis and the weather score is shown on the right y axis. The data itself is grouped by month.

The Drivers#

Sources#

Fatality facts 2022: Yearly snapshot. IIHS. (2024, June). https://www.iihs.org/topics/fatality-statistics/detail/yearly-snapshot

Moosavi, S. (2023, May 28). US accidents (2016 - 2023). Kaggle. https://www.kaggle.com/datasets/sobhanmoosavi/us-accidents

Beniaguev, D. (2017, December 28). Historical hourly weather data 2012-2017. Kaggle. https://www.kaggle.com/datasets/selfishgene/historical-hourly-weather-data?select=wind_speed.csv

Gutierrez, F. (2017, October 31). Traffic violations in USA. Kaggle. https://www.kaggle.com/datasets/felix4guti/traffic-violations-in-usa